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ABSTRACT 


Let M = £Q54,, & FD be a finite automaton over the alphabet 
S ={1,...,k f. A state q € Q is a dead state iff q f F and 
Sg.d=9 vee fx. Let - be a mapping from $* onto the non-negative 
integers defined by Je 0 CA is the empty string) *L= RX 4K, Kes aes. 
Define (1 (n}= Hleaeh : [OSX sn} and A,(n= toe A: Pla)- nf If A is 
regular let M, be the minimal automaton recognizing A. Each automaton 
M induces a Markov process obtained by considering the inputs to be 
generated by independent rolls of a k-sided fair die. Let p(M) repre- 
sent the probability of being in a final state. Let p(A) = p(M, ). The 
following are proved: 1») BO senda 59, 2) we ~~ Oo , A regular 
=> p(A) = 9; 3) p(A) =0 => M, has the dead state as the only 


absorbing state; 4) Ve 0) 4 a regular set AIM, has a dead state 


A 
and AalNtd) 5 9 sere R-e<och ; 5) xs p(A) = 0, then Aa(n+1) 
Ag (n) Aan) 
cannot converge to k. With k = 2, these results prove that there is no 
a 
regular set A such that A (antl) = - I i. (at T) ane A, (an) - QO. Hence 


there is no l-1 mapping from the set of all Crees representing expres- 
sions involving a binary + and a variable x into {+z * which pre- 
serves the number of +'s and x's and such that the set of tree images 


is a regular set. 
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I. INTRODUCTION 


Although the theory of finite automata (or sequential machines) is 
itself only about fifteen years old, much effort has already been devoted 
to exactly what automata can and cannot do. And deservedly so, for 
several reasons: First, and perhaps the most "pure' of the reasons, is 
that an automaton, by virtue of its definition, has interesting mathe- 
matical properties. Beyond this theoretical consideration, however, lie 
some more practical reasons. Generally speaking, an automaton is the 
simplest model of a digital computer. But since the number of "states" 
in a computer may be of the order of 2 direct application of re- 
sults of automata theory to an entire computer is certainly not practi- 
cable. Application of automata theory concepts and techniques is 
restricted to systems with relatively small numbers of states (at most 
in the thousands). There are, however, practical situations in which 
this limitation in size is met. The sequential circuits which are the 
basic component: cf computers are specified by the input-output trans- 
formation which they must realize. The circuit's operation is described 
in terms of states and since it must be humanly manageable, the number of 
states cannot be too large. In fact, any computing device, organized as 
an iterative array, can be separated into smaller components which will 
be circuits with a small number of states. Automata theory also has ap- 
plications in flow-charting and program equivalence. 

Many of the results achieved concerning regularity or non-regularity 
of sets have been done in the context of equivalence relations of finite 
index and a fundamental lemma concerning the way certain input sequences 


can be separated. Only three years ago Marvin Minsky and Seymour Papert 


[10] developed a set of criteria for non-regularity based on a limiting 
quantity which seems intuitively to represent the portion or percentage 
of strings that are in the regular set. In this paper we define for 
regular sets, in terms of natural Markov processes, another quantity, 
p(A), which, generally speaking, again represents a kind of percentage 
of strings in the regular set and show that, when Minsky's limits exist, 
these two quantities agree. From the quantity developed in this paper, 
however, more information can be gathered concerning the machine which 
recognizes the regular set (and hence information about the regular set 
itself) than can be gathered from Minsky's criteria. Also described is 
a special machine which gives an upper bound for the growth rate of sets 
recognizable by a certain type of automaton. Finally it is shown that 


the sct of well-formed trees written as strings is not a regular set. 


II. GENERAL DISCUSSION AND DEFINITION 


We will give here only as much basic automata theory as is necessary for 
the completion and understanding of this paper. More detail and further 
explanation can be found in any number of texts, such as Harrison [6], 
Rabin and Scott [11], or, again, Rabin [2]. 
Before proceeding into the definition of an automaton itself, we 
shall concern ourselves first with the type of input an automaton receives. 
Definition 2.1 An alphabet a is a finite set of symbols. 
Definition 2.2 A tape is any finite sequence of symbols from the 
alphabet. Included also as a tape is the empty tape, denoted TAS 9 
which is the tape with no symbols. Defined on the set of tapes is the 
operation of concatenation, or juxtaposition, i.e., if ©“ is a tape 
and B is a tape, then x.8 is the tape formed by "concatenating"” 
CA” sand GC - Lf Ris a set of tapes and S is a set of tapes then R.S = 
{x.G eve h@and BeS § 
Definition 2.3 Re = iA § 


Definition 2.4 R* 


gn + 1 n 


Ld) 


00 

PR 

L=90 ; 

Thus we see that the set of all tapes from the alphabet , is Fx. 


With this operation of concatenation defined as above, < 5"* TAN my 
is a monoid, and in fact, y * is the free monoid generated by z ° 
For this paper we shall let S =f1,a}, although the results are 
true for any finite alphabet fy 2,80, Sek } . The prime motivation 
for this particular s is that it is the simplest case having all the 

properties of the general case. The additional fact that most real- 
life machines are binary oriented is another consideration in choosing 


the two element alphabet. Then S. * is the collection of all possible 


sequences of 1's and 2's, including the empty string A - Here the 
reader must be careful not to associate any numerical significance to 
the symbol "1" or the symbol "'2", These symbols are merely inputs to 
the machine which we now shall define. 

Definition 2.5 A (finite) automaton over the alphabet s is a 
quadruplet M = <a dos oO Ps) 7 where Q is a finite non-empty set (the 
set of "states"), do is an element of Q (the initial state), on is a 
mapping of Q x s into Q (the state transition function), and F is a 
subset of Q (the set of "final" or “accept” states). 

The state transition function S can be extended from Q x $ to 


Q x $ * in a very natural way by recursive definition as follows: 


bata oe = rg forvall q Game 
5 (q, A.x) = 6 (8 (a, ),x) for WES*, xe Z 
q € Q 


Definition 2.6 The set of tapes accepted or defined by the automaton 
M, denoted T(M), is the collection of all tapes % in s* such that 
Ss (do &% ) is an element of F. 

Definition 2.7 A set R is called regular (or recognizable) if and 
only if there exists a finite automaton M such that R = T(M). 

The preceding definition merely says that a set R is regular only 
if there is an automaton M which accepts all tapes in R and rejects all 
others, i.e., S (aps MX ) €' Fatt , Ore R. 

Recall now that va has been defined as the set dels 2f - in order 
to enable us to lend some numerical bearing to this research we define 
the following one-to-one mapping from {1, 2 { * onto the non-negative 


integers. 


: { 1, 2a} * ——» Non-negative integers 


By establishing a sort of alphabetical hierarchy between 1 and 2 in oy 


we can define the "bar" mapping as: 


A= o 


Psp S) SE EL ot ous 
il 
a 


faa 
—_ 
a" 
tb 
~~ 


Tnus we now have associated with each member of > * a non- 
negative integer, and vice-versa. We call the "bar" mapping the 


encoding mapping and its inverse the decoding mapping. 


As the last general definition, we define a natural relation in 


the monoid XS, A ~yr 
se J 
Definition 2.8 Let & Be $ *, Then a< 6 —> ges 


kj 
¥ Be oul) 


Ili. LIMIT THEOREMS 


Let A be a subset of fi, ae The question here is what kind of 
automaton could "recognize‘’ A in the sense of definition 2.7? Since for 
any regular set A there can be any number of automata M such that A = T(M), 
we shall concern ourselves only with the unique minimal automaton recog- 
nizing A and we will call this automaton Ma: 
Definition 3.1 Mn) = the cardinality of the set fae: 0% <n} 


Definition 3.2 “Lleracre {1,2 =. Thestengeh of xX, ba) is the 


ae ne } 9) 
number of digits in % . Then we have ae a ZAKS a Further- 
Leo L720 


i 7.9 a+6, aGef™ 


Definition 3.3 Ayn) = the cardinality of the set fac: L)en § 
Definition 3.4 A dead state of My is a state q € Q such that 
is CayXe yp CFF is satistree by Me “omc fy, a - Since there is 

only one dead state in the minimal automaton, this is equivalent to 
saying i ca COR) SPahior all A € fi,a}™ 

The next three theorems are generalizations and small extensions of 
the work done by Minsky and Papert in [10], and are stated without proof 
since the proofs are, with only minor modifications for theorems 3.5 and 
3.8, essentially identical with those in [10]. The first two theorems 
are concerned with the consequences of My having a dead state, and the 
last theorem indicates what must happen if M, has no dead state. 


A 


Theorem 3.5 Let M = My and suppose: 


S lags y) is dead 
| z+V¥, 
ob) § = Xt 


(c) lim i, (") - 9 


10 


it 
a 


Conclusion: 90 


From theorem 3.5 we get immediately 


Ti] 
Corollary 3.6 If {3 Batol —> a(€) for all real ¢ and if 


@(§) | = f = » then A cannot be a regular set whose minimal auto- 


maton has a dead state. 


y(n) 
Tal€n) 


case we can still sometimes use the following theorem. 


In some cases the sequence { Bate) f fails to converge, in which 


Theorem 3.7 Let OX - be the rth member of A in order of magnitude 
under the encoding mapping. That is, there are r-l members of A whose 


SE 


image under the encoding mapping is less than Of, . Then if 





Maine en 
| ic - 0 
r 00 x, 
A cannot be @ regular set whose minimal automaton has a dead state. 
AN “4N 
/ 
Theorem 3.8 If A is regular and Ma has no dead state, then IO) » 9 
where N is the number of states of Mae Thus the density of A cannot 
converge to zero, 
Combining these results yields the following criterion. 
Criterion: To prove that a set, A, is not regular, it is 
sufficient to verify condition 1 and condition 2 or 3. 
aT} 
Condition l. 1,(n) a) as naw 
n 
Tal”), 46) as n-roe,and ADH ¥ FFL 
Condition 2. 7— 7. (én) > O(E) 4 
Nats” 
Xref - Xr 
Condition 3. — ee ae 
rn 
If A is regular, then by theorem 3.8 Ma has a dead state, but by corollary 


3.6 or theorem 3.7 it has none. Thus we are led quickly to a contra- 


diction. 


yal 


IV. THE PROBABILITY OF ACCEPTANCE 


When using limit theorems as the ones developed in section three 
we run into several problems. First, of course, is the nature of the 
limiting process itself. The limit may or may not exist, and, even if we 
can establish its existence, we may not be able to determine its value. 

a7) ‘ 

Secondly, unless [14 (n) is a relatively easy sequence to recognize, we 
cannot begin to evaluate the needed sequences in the theorems. If 
in) 
coe 


lim 
n-97®@ 


strings of 1,2? * which are in the set in question, namely A, and 
2 


exists, it can be loosely interpreted as the percentage of 


hence as some sort of probability that a string is in A. If A is regular, 
this reduces to a probability that a string is accepted. The above inter- 
pretation is all on an intuitive basis and will not be subjected to any 
rigorous analysis. In this section an algorithm is given which defines 
another quantity which, again loosely, can be interpreted as the prob- 
ability that a string is accepted. 

If we assume that the set A is regular, then M, has a finite number 
of states. Consider the transition function of M,, 5 >» written as a 
transition matrix, i.e., if the machine is in state qi» then with prob- 
ability 1/2 we apply to q, an input of a "1", and with probability 1/2 
we apply an input of a "2". This is not to say that the machine is no 
longer working deterministically, but merely that the input is generated 
by a sequence of Bernoulli trials with the probability of a "1" = the 
probability of a "2"° = 1/2. If we were dealing with a k-symbol alpha- 
bet | , then the probabilities would have been l/k. By 
evaluating § (a, 51) and SCHR we can establish which states are 


accessible from qd; with an input of length one. Agsume M, has N states. 


12 


Then by evaluating $a.) and § (a,,2) fOr iar, 1222... ,N-1 and 


arranging the states of M, in a square pattern viz., 


A 


In-1 4N-2 ia Yo 


Wn-1 


and letting the q; jth entry in this pattern be the "probability" that 
§ (q, 9% ) = q; given that the length of % is one, we get a matrix 


of values that begins to look suspiciously like a Markov chain. Upon 


further inspection, we see that ali entries are either O ( 5 (4,5 ) 


q, Femeneither sles: lamer «XX =e2)xe 1/2 « 6 (a, y) = q, for either 


X =1l or A = 2), or l ¢ 6 (aj.%) =a, for both & =1 and O 
eee Cinaliypit ismnotedmthat as Gi i =] j Lana PA ecu. 
Thus we do indeed have a Markov chain. Let us cail this matrix M, since 
it describes somewhat the original automaton. Clearly this matrix does 
not define the automaton since the transition function can only be inter- 
preted generally from this matrix and we have no information whatsoever 
on which states are final states. However, we can glean some information 
from this approach. 

From Feller {3], we know that M can be divided, in a unique manner, 
>» ©, , such that from any state of a given 


acl k 


set all states of that set, amd no other, can be reached. We shall use, 


into closed sets, Cy» C 


without ambiguity, the notation e to represent both a block matrix in 
M and the set of states making up that block matrix. M can now be re- 


written: 


a, 
: (Cy) 0 o 


fj ~ 
ha 


Since each C. can be treated independently as a Markov chain (all entries 
to the right and left of each individual oe are zero entries), and each 
C. is a closed set, then each ss is ergodic and the stationary (ergodic) 
probabilities can be calculated. These stationary probabilities are 
merely the probabilities of being in the various states of ss after a 
large number of steps given that the process was already in a state of 
C,. The results of the ergodic theorem [5] make these probabilities 
independent of the starting state in a i.e., if q, © Sys and P is the 
stationary probability for state qa? then P, = noe (C."),. i all i 


whe re (C)s5 is the iath entry of the matrix C In general, M will 


kK? 
also contain transient states, i.e., states not in any ae from which 
states of the closed sets can be reached, but not vice-versa. These 
transient states are carriers, taking the process from its beginning 
to one of the closed sets. 

Let us look more closely at these closed sets oF within the context 
of the entire Markov chain M. Since each a is closed, i.e., we can 
never leave a once we are in a state contained in on each Cis i =e, 


2,0-.-,k taken as a whole is an absorbing state in the original Markov 


chain M. This means that if the process begins in some transient state 


14 


and travels eventually to a state in Cys then the process has been 
“absorbed” into never again to leave. This may be better shown if 
we let Cc, be the number of states in Ci» i= 1,2,...,k, and associate 


with M a matrix M' where 


(I) 
(I, 





The associated matrix M’' is the original matrix M with each oF replaced 
by a = x SS identity matrix i. o If we let r = ci SS ois sea ch and 


J 
let s = Ner, then M’ has the canonical form 


where I is an r x r identity matrix, 0 is an r x s zero matrix, R is an 


s x r matrix, and Q is an s x s matrix. The canonicai form of M" is recog- 


nized more readily as an absorbing Markow chain. 


Now (M‘’) gives the probabilities of being in the various states 
starting from various states after n steps. From Kemeny and Snell [7], 


n 
we know that (M') can be written 


Q" 


Ae RA ate ore I 


where RO is merely the s x r matrix of m'y® after we have blocked off 
pes 0 Q” in the manner shown. (Note that RO is not necessarily leas 
This form shows that the entries in Q” give the probabilities of being 
in each non-absorbing state after n steps for each possible non-absorbing 
starting state. By Kemeny [7], we know that the probability that the 
process will be absorbed is one. Therefore each entry in Q" must approach 
zero as n approaches infinity, which says that Q" —>Oasn 7. 
(After zero steps the process is in the same non-absorbing state in which 
it started. Hence 9° =I.) But a" —? QO is a sufficient condition for 
I-Q to be non-singular [8]. Let K = (1-9) 7}, We call K the fundamental 
matrix for the given absorbing chain. 

Following closely a procedure given by Kemeny and Snell [7], we 
will now describe an algorithm which will compute a quantity which we 
call p(A) that is an indicator of the type of automaton which recognizes 
A and represents, in a sense, the probability that a string is accepted. 


Furthermore, if A is regular, p(A) will always exist. 


16 


Given a regular set A and the minimal automaton recognizing A, Me, 
construct the state transition matrix M from the transition function. 
For ease of computation put the Go, th row as the last row of the M 
matrix. Find the closed sets, the Cis in M. Compute the associated 
matrix M’ and put M’ in the canonical form. Next compute K = (Geo, 
and let B = KR, where Q and R are as in the canonical form of M'. The 
entries in the bottom row of B (the qo row) will give us the probabilities 
of ending up in any particular absorption state. These probabilities are 
the absorption probabilities. For each or of M, sum the absorption prob- 
abilities in the corresponding I of M’. These sums are the probabilities 

| 

of ending up in any particular closed set fe Now either by computing 


lim Cae for some i and j or by solving the following system of equa- 


tions, find the stationary probability for each non-transient state. 


Cy 
Equations 4.1 (a) Pj - - pie F. 
i=1a,..4 


m=] 

j = 152,-505C; where P is the stationary 
probability for state q.. : 
J CL 

oy Z Pant 
m=] 


Note that in the above system, for each i, there are Cc. + 1 equations in 
Cc. unknowns. Equation (b) is necessary since there are only c,-l in- 
dependent equations in (a). Multiplying the stationary probabilities 

of fe by the sum of the absorption probabilities for that e yields the 
final probability of being in each particular state. Now sum these final 
probabilities over all q. € F and call this sum p(A). The procedure we 
have just described has been for the general case where do is a transi- 


ent state. If VW ig not a transient state, then this implies that M 


itself is ergodic (otherwise we would not have the minimal automaton) 
and the procedure reduces to finding the stationary probabilities and 
summing the stationary probabilities for all qs € F, i.e., solving 
equations 4.1 with k = 1 and c, = N, the number of states in My» and sum- 
ming the correct probabilities. Finaily we condense this description 
into 

Algorithm 4.2 Given a regular set A, and the minimal automa- 


ton recognizing A, M,, 


p(A)= 2 Fa 


Pa: (4 (KR), }: Ps 


eg 


where 


and P, is found by solving system 4.1. 

At this point one may ask what if a transient state is an element 
of F? But since a” —» 0, the probability of ending up in any transient 
state is zero, and hence this state would add nothing to p(A). As an 


example for algorithm 4.2, we will compute p(A) for the following 


automaton: EKO. do? é ne a where 


~~ 


Q = fa9+ 4494724394 ¢ ee cea ny and b (aos) = do» 

(do22) = dy § (4,51) = das Ja, 52) = dos S(ays1) 
5 (4552) = qos 5 (ays) = dy § (45.2 = 3s baz) = dys 
$ (4,52) = 43> 


M for this automation is, 


18 


{4 bs b2 i if 
hi kg O80 10 \ 
y, Se WD me 
M= O 100 |% 
O /) I, O 0 ik 
\2 Oh hk 0 | g 
[hod ba iinld 
C ( 7 (1) 
ll ly -(f : 
R Pay; 6 _—_ 
[1 O "fh 
-qek -/, 7 K-(I-Q) /2 | 


Finally we calculate, 


WPT Y Oh h\ [0 hk hk 
AIN\O 0 RJ \O A A 
cae ci 


Looking at the bottom row of KR we find that the probability of ending 


up in C, is 0 + 1/4 = 1/4, and the probability of ending up in C, is 3/4. 


1 2 


Next we compute the stationary probabilities in C, and C and find that 


1 2 

the stationary probabilities for Io» 435 and q, are 1, 1/2;. and 2 
respectively. Therefore, the probability of ending up in qo is 1°3/4 = 
By q3 is 1/4°1/2 = 1/8, and in q, is 1/4*1/2 = 1/8. Summing over 
the states in F, namely q5 and q,2 we find that p(A) = 3/4 + 1/8 = 7/8. 

From the definition and construction of p{A) we obtain the follow- 
ing theorem. 

Theorem 4.3 If A is regular and p(A) = 0, then My must have a dead 


state, Furthermore, the dead state is the only absorbing state in My 
and the transition matrix M is an absorbing Markov chain with one 
absorbing state, namely the dead state. 

Proof: First of all, the assumption that p({A) = 0 implies that the 
matrix M itself cannot be ergodic. If M were ergodic, we could reach 
every state from every other state and from Fisz [5] we would have all 
stationary probabilities positive. But p(A) = 0 implies that the station- 


ary probabilities for states of F are zero. Hence, we have a contradic- 


tion. Therefore, consider the transition matrix of My in the block form 


20 


C) 0 


(4) 0 
fy > 
fo > 


Let us first consider each C Again from Fisz since each C, is a closed 


k° k 


set and every state in C, can be reached from every other state in C 


k 


k? 


all stationary probabilities in each Ch are positive. Since the machine 
is minimal, each state is accessible which means that the probability of 


1s positive. Hence, no state in any C, is in 


being absorbed by each C k 


k 


F because otherwise p(A) would be positive. This means that we have 


remaining only C, 's with states not in F (and there must be at least one 


k 


such C. since the probability that the process is absorbed is one). 


Since these C. "s contain only states not in F and are closed, i.e., if 


q éC, feet 2}* > oy (g, a) é i , each of these states 


is dead. Because M, is the minimal machine, there can be at most one 


dead state. 
Corollary 4.4 If A is regular and p(A) = 0, then for every 


Ox ef Bef*> 5 (.,4-8)= 9, where dp is the dead state of Mae 


Proof: From theorem 4.3 we know that the transition matrix of My 


has the form 


ZI 


where the lone absorbing state is the dead state. We know also that the 


probability that the process is absorbed is one. This means that from 


any state of Ma we must be able to get to the dead state. Therefore, 


ror ay 68" 18S" 9515Gp.),)-5Q,,2.8)= Yo - 
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V. THE AGREEMENT OF THE PROBABILITIES 


?,(n) 


Thus far we have presented two quantities, lim |W 407 » and 
Nn-~w 
p(A), both of which seem intuitively to represent the probability that 
a string is accepted. On the one hand lim y(n) exists for some 
N77 Nn 


regular and some non-regular sets, while p(A) exists only and always 
for regular sets. In order for the conclusions drawn from both these 
quantities to be truly valid and meaningful, they should agree where 


they both exist. In this section we show that this is indeed the case, 

4P4(n) 

ener 
fT 


i.e., if A is regular and lim = & , then J = p(A). Before 


N90 
we can show this, however, we need the following lemma. 


“)\ 
Lemma 5.1 If T(n) 5 6 » then Ran) >. 
f) en 
Proof: Suppose fain) doesn’t converge to O » Then J E>O 
oe da(n) 
such that for infinitely many n ee ? QO + € (or <O-€ ae 


Let M be this set of n’s. Now let Xn = whe (the string, not the 


OC. _ 
number). Since Saat —> [= so IN, 2 fOmall n @ cAY 





gn 

on, E 

—_ - Lae - similarly, since _ 3 

on On tL | - 74 a Zz, IN, 

> ‘ ail € 
for alln 2 N, Fe = bum re | < lat » From the ie IN, 3 
for alln 2 N, | Tal& wy, ) | Dams . Choose n >N + 1 where N = 
3 [| -— -@ Ti 

max (N, > Nos N.) and such that n € M. Again let @&%, = 2. Now 


Mal%) My) + dal) 1s (Fees) + daln) 


a, x as is y 
J ES 





23 





; I, (n+) ’ A, (n) 


7 | “gn -Z 


¢ f | le 4 
Al(Q-a)tl 4-7 








since 0< Ma(Kot) ¢ 1 | ia SAV =? 


Xr 


a a) | Le 
oY, 


€ 
it | «4 








= 


TD, (Gn- Da(So-1) _ Da (%o-1) | < Wy 


Yn a 





Similarly, since 0S ae ss ia A >M => 
Ain) {| 2" — 4e 3 
=> = Ph ae "4 < 4 


Aa (n) _ de (n) | < § 
a2 a bP - /4 
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Therefore, forn > N+ 1 


= . ae _ Wy + dala) _ Y 





Thus for yo many n, 


Bi.) » +/e- ga YFOTE- %, | 


= +/ 26 + | « o+% 
Tan) 


Theorem 5.2 If A is regular and Agln) => Gg » then 0 = p(A). 





But this contradicts the hypothesis that converges to 3 ‘ 


“gn 
Proofs First we define naln) to be the cardinality of the 
© porefi, at*: 509, a) + Ga. and Ver)=n§. Then clearly 
1) A oP (n ) » Lf we consider the transition matrix of 


Ma we = two cases. 
Case 1. M itself is ergodic. Then the probability of capture in 


¥ n | 
state q, in n steps is dan =(M he - Taking limits on both sides 


we have, from Fisz m4 dan) fa (IM fin 7 =a » where Po 
= n> 0G 


” pape =e 
is the stationary probability for state a Finally, 
a fim Ayla) = lim §" aatn) 
. es or [)-? 00 
fae 


2> 


Case 2. M is not ergodic. In this case, as in algorithm 4.1, we 


rewrite M in the form 





If q, is not in any Cis then Aa(n? = (Q") andi een ),(n) 
0 


n 
lim (Q") = 0. Hence we can eliminate consideration of all 
nN>0 Oa 

states of F which are also transient states. Assume now that q, 6 Co 


Let T be the set of transient states. Then the probability of capture 


in state a in n steps is 


Ale). 5 £1E 2) (8), Mn"), 


Fix t € C, ands €T J CR), a0. Ue 3) = 0, then this path 
adds nothing to the sum of all paths which have length n and end in 
state q.. iGe (R) + = 0 for all s and t, then we do not have the mini- 
mal machine, since this would mean none of the states in CS are access- 
ible. Therefore, there exists at least one combination s, t such that 
(R) 6. f 0. Let P be the probability of capture in state q, inn 


steps via this certain path. Then 
P= 2 (a). (R) 
er a ai M }, 
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(Are e000 


We know the following things: 


ay A Z (94) =(T-Q).. for all s 


N—-7 ® 
from Finkbeiner [4]. 


OP 
-/ (( k 
Furthermore (T-Q) »- “A , ae: » Or else state qd, would not 
=O 


be accessible. 


- ; n Poe 
(ii) Since (M en (Cc. es 


- r? _ 
(iii) Ct (Q ye = 0 fOmnali s 


Given €>Q0 ‘ 


(a) Choose N. = efor = N. 


2 (@), (2-9 |, 


(b) Choose N, Y forn 2 N,, 


|(1”).,- Pal < At-b),. 


(c) Choose Ny Jforn 2 
iy A £ 
< eee einen somerset I 


Gz max {(4°), (My oo (m4) § 


“Be l(a * (Rhee (M*), +(9"") (dt) | 
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where 


D,*[(a"™) (Rt -+(@), Roe | 


By (c) since n-N, > No 


O< 6,4 5 (Q*) (k).°G 
b: 1-N, : 


< N,- @EN, GN, Rie G= 


Then, using (b) also 


Dy pa- D ED g cfs €+D, ps eer 


By (a), since n- (N., + 1) aN +N, 2 N, 


-f i% & 
Dape-|(T-9), "Ra lea, < 


Ty ee 
P,< Dn patét]@ 0), ‘i ear 


or 


D,pa-€- Fs Pa Dypat act § 


Where 





RS 


(Ti i (T-Q),, Rh 


Since £€ can be made arbitrarily small, we have 


+ 


barn L =e Dy fe “iee| 7 (@*). (Ra. | 8 


N=» Oa 


: (I-0),, (R) Pe -), (R). fre 


And finally we arrive at 


fim Jaln) hm gf fale) 
N—-> &B a fi -> & a 


foe F 


= Z 7 Va Fade 


foeF Lo ascii 


oo $- F tow Price 


foe fel SET 


2 7 7 (kh (KR), P 


fac F LEC set 


pf) 


The reader is advised to compare this with algorithm 4.2 in which we 
described the construction of p(s). ) 
AG 

Theorem 5.3 If A is regular and —— 3 O, thei 9 = p(A). 


Proof: Apply lemma 5.1 and theorem 5.2. 
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VI. THE FIBONACCI MACHINE~-A GROWTH RATE UPPER BOUND 


For the proof of the next theorem, it becomes necessary to look at 


a particular machine of N states, one of which is a dead state, which 


we shall call My: Described formally, My = <Q, dos a . F? where 


Lo fag: q,9 Ios eee dy sf 


dies { ao» Tyrese f 


and the transition function is given by 


5 (a, 52) = Io > fi a O-line. . ,N-2 
§(a,.) = W547? 1 = Ceres .N-2 
§ Cay 42) = Wy 


ra xe 
OM 1. ) _ Wel 


It is obvious from the definition of the transition function that Wwe] 
is the dead state and is the only absorbing state. Hence p(M,) = QO, 
(here we have substituted the name of the machine for the regular set 
in the argument of p(-), but the meaning is clear.) It is more helpful 


to look at My as described by its transition tree graph given below. 
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We have labeled the entire tree as Tg and the N-1 sub-trees as shown, 

(Henceforth we will omit the machine or regular subscript on yi since 

it is clear what machine we are talking about.) Forn < Nel, vielen) = 

2" since all xefia} * of length less than N-l are accepted. For 
Nol 


n = Nel, At) = a since only & = 1 is not accepted. Now for 


n > Nel, since the T i = 1,2,...,N-l, are mutually disjoint, 


Ain) = azn) ied. t)tdz, (n-a)t-+ A _(n-(w-1)) 


where Ap) is the number of idk of length n Pricted counting 
the length from qT, as a base. Furthermore, since each Th. taken sepa- 


rately, is identical to T o° each T, alone recognizes the same set as 


Tos which means that ly (n) = be (n)= an) ,4 af sony N-{ and that 
Ma) = An-t)+ Maa) Ma-(WD) , newt 
which we recognize as an N-l term Fibonacci sequence. For N=3 we get 


the familiar two term Fibonacci sequence 


Aw) = 1 


AQ) = 2 
A(z) = 3 
Aa) = 5 
Aw = 8 


° 
°o 


Because of tne type of growth of Any we call My the N state 
Fibonacci machine. Since we hawe a Fibonacci sequence, from Alfred [1], 


we may calculate 
; Am (n+t 
k= ey N 
ae My (n) 


by solving the polynomial 
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Pula tte) Sey 120 


Py Od is continuous and P (9) = -l, P (2) = 1, which means that 
there exists a solution of Py = 0 that is greater than zero and less 


than two. Furthermore, since for x > 2, 


a ; “2(4)° z tet toy 


we have 


ee ee ae ee ae 
=> NL >NE ENE +. pe NEEM 
a> NL ONDE +(N-A)L 4. + AUT 
=> NE (WDE - (WA) + -Q¥-1 >0 


Therefore all solutions of P tx) = 0 are strictly less than two, which 


means that 
him An Any, (net ram, 


prod Ray Ai 


Hence for any k, 


bime Aa) — fm (dn lnk) ise 
nc Un) N> Am (ntkt) ala. (n) 


An, (nek) Am Am, (41, (n+1) 
= aa (nvk-1) mae Ann) gh 


by 


With the help of this definition and discussion of the N state Fibonacci 
machine we can prove 
Theorem 6.1 For every £ > ©, there exists a finite automaton 


(and hence @ regular set A) such that 


Aglnti) 
we aie o 


where 


A-ES O<K 


Proof: Since for every E€ > 0 


00 | dhe ss 1 

ae = a-€& — Pa 
7 (x2 ik sat Soe Lae é } 
bz1 JE 


choose N sufficiently large so that 


Seay 


Form the Fibonacci machine for this N. From the preceding discussion, 


fam Any (nt1) a 2 


namely the fact that pm»a Ame (1) (n) 
K 


ficiently large n, dn nti) < Lttw(n) I and hence that 
N+l vi < 
An Any lr) mo) of of 
“gn | 


LLY 57 = (2-0) Woe) --eedrt >(2-6)" 
hz] 





we have, for suf- 


Secondly, 


=> (9-¢)'-(a-€)"- + -@b)-1<O 


| a 
= P, (2-6) <0 
Since Py 6?) = 1 >0 for any N (and Pts) is continuous) 


6 = Lavy Amy lit (nt) | > 2- ry 


a RE cnitiy Anu (N) 


The Fibonacci machine is especially interesting since it seems that 


for any other machine, Mas with N states and such that aa(n} —= 6 
an 


4 
ae (n) = Aug(N) for ali n. Aithough not rigorously proved 


here, this conjecture lends itself to an induction proof on n that would 


3 


proceed first from the fact that for n ¥ N-1 the conjecture is obviously 
true ( Amy (2) = on n < Nel and Ray) = 2°.) forn = N-1) 


since the dead state must be accessible in M 


A? the other machine, in 


N-l or less steps, and then an argument showing successively how My must 


have the minimal number of elements in state qn is the dead 


2° (4y.y 
state), W-32°°° 994 at each level less than n, and hence the maximal 
number of elements in state q, at each level less than or equal ton, 
thus implying that at level n +1, A ayltt) 2 Any (n+1) - Intuitively 
speaking this conjecture is not hard to accept since in My there is one 
and only one way to get to the dead state and this path is as long as it 
can possibly be. Furthermore, all other states besides the dead state 
are accept states. 

With this conjecture in mind, the Fibonacci machine gives a growth 
rate upper bound for machines which have only one absorbing state, namely 
the dead state. This growth rate upper bound is seen in theorem 6.1 to 


be arbitrarily close to two, but always less than two. That is to say, 


for any machine M which has only the dead state as an absorbing state 


lim An (nt1) < 
n> ae (n A. 
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VII. THE SET OF TREES AS AN INPUT 


For all automata previously discussed, the input has been a one 
dimensional tape, i.e., a string of symbols. A tree automaton is essen- 
tially the same as an ordinary finite automaton except for the input 
which is, as the name implies, the set of well formed trees. In order 
to describe this set we need the following definitions. 

Definition 7.1 A ranked alphabet is a pair ais oS? where A is a 
finite set of symbols and oy e. > /h/ » the set of non-negative integers. 


If we further define 


A, ~{aeA 0 (a)= 0} 
AY “facA:O(@)-1f 


o 


° 


a= fash: o(a)=R§ 
then A = A, VA, U:- UAR 


Definition 7.2 OQ isa free iff WV éE Aa or X= & 

where fen andmme, © A, f= tm co oot! ce 
n i 1 
tits ts 

The set of all well-formed trees is a regular set in the sense 
that it can be recognized by a tree automaton. The question here is: 
Does there exist a one-one mapping from the set ot well-formed trees to the 
set A* such that the number of times an element of A appears in the 
tree is the same as the number of times that that particular element 
appears in the string which is its image and such that the image of the 
set of trees is &@ regular set in the sense of definition 2.7? For simpli- 


city, and again without loss of generality, we consider the case where 


tae 
i 


= ft, L$ and OOjeR ‘ Tatia= 0 - The condition imposed upon 
the one-one mapping reduces to saying that if a tree has k +'s and (k + l) x's 
(all trees must have one more x than +), then the see which repre- 
sents that tree must also have k +'s and k + 1 x's. An example of a tree 


for this A is + 


Let Y be the set of trees over 44,73 » Assume ¢ is a mapping which 
; ¥ 
satisfies the above conditions, i.e., Q : ¥ > fxs 2 TM) = OW) 
for some M (we assume M to be minimal) and the strings in ~p (Y) have 
the correct number of appearances of each element in them. From Knuth 
[9} we cam that the number of trees with n +'s and n + i x's is 
a 

c+ aa Let B = g (Y). The restrictions on Q give us 

Me fon) O and A, (antl) = Ca for ~/ new Using Stdr limes 


approximation to the factorial, i.e., n.% Joan (2) > we find that 





a 
n (Nt) an 
Then an 
Aglantt) = Cn wy (Atl) Jen — 0 
a atl ¢/ gantt 


and 


dg (an) O 
gan “pen > 0 
Therefore, 
ds(n) _. 9 => p(B)=0 
M, must then be a machine with only one absorbing state, namely the 


dead state. But if we look at the by 


a(antt) — Gm ron eee 
Ag (an-D) ila 4 (me 


i) 


we see that the growth rate is too large for a machine of this type. 


Hence, a contradiction and so no such mapping exists. 


hy> 
Sena 


VITI. SUMMARY AND CONCLUSIONS 


Looking back now on the problem of section 7, we can see why some- 
thing more than the criteria given in section 3 was needed to solve it. 
First of all, so little was known about how the supposedly regular set 
B behaved that it would have been difficult to decide even if lim Tefn) 

N>g 
existed. Secondly, although theorem 3.7 could be applied, it was not 
a sharp enough criterion to give any useful information. Essentially 
all that was known was that d z(an)=0 and de (ane) = - However, 
with this new concept of p(B), and knowing theorems 4.4 and 5.2, we 
could deduce information about what type of machine would be needed to 
recognize B, namely a machine with a single absorbing state -- the dead 
state, Next, with the Fibonacci machine providing a growth rate upper 
bound for this particular type of machine we could show that Aen) exew 
too fast for this type machine and hence B could not be regular. This 
problem was especially difficult, since the growth rate of A,(n) was 
just barely greater than allowable and a reaily fine line had to be 


drawn to mark the cutoff point. Therefore, p(-) is an improvement over 


e “ip (n s 8 e Ld ° 2 9 LJ e 
lim Btn) in that it is a more specific indicator giving more in- 
N-9@ 


formation about the machine, namely not only the existence of a dead 
state, but that the dead state is the only absorbing state. Coupling 
this with the "maximality"’’ of the Fibonacci machine yields another 


criterion for showing a set to be non-regular. 
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